Accessing Web Databases Using OGSA-DAI in BDWorld

نویسندگان

  • Shirley Y. Crompton
  • Brian Matthews
  • W. Alex Gray
  • Andrew C. Jones
  • Richard J. White
چکیده

Interoperation of heterogeneous and autonomous data resources is a common data management issue for GRID applications. The OGSA-DAI [1] middleware addresses this need by providing a service-oriented framework aligned with OGSA [2] to faciliate the access and integration of database and semi-structured data resources in the GRID environment. In the BioDA [4] project, we are evaluating the benefits of using OGSA-DAI in bioinformatics GRIDs by establishing communication between OGSA-DAI and GRID project developers as well as through practical case studies involving current projects. In this paper, we describe our experience in applying OGSA-DAI R5 to one of these projects, BiodiversityWorld (BDWorld [5])-a GRID-based problem solving environment that specialises in the exploration and analysis of patterns in global biodiversity. BDWorld's database handling is characterised by the diverse types of database used and the heterogeneity of the data, with respect to both the data structures and standards. Many of these databases are also autonomous internet information resources that a client queries by keyword searches rather than SQL. In contrast to the diversity of data resources used, BDWorld currently only requires a limited range of operations on these resources. One such operation is to create a study data set by aggregating data from iterative searches of remote data collections using the same biological taxon object as the search parameter. This particular scenario requires accessing and integrating heterogeneous data from distributed collections and, we feel, should make an ideal test case for OGSA-DAI. There are two main ways in which we could introduce OGSA-DAI into BDWorld. One possibility is to leverage myGRID's [6] distributed query processing tool, OGSA-DQP [7] and use it to assist in planning the execution and distribution of data-oriented parts of a BDWorld workflow. But this would require a drastic revision to the BDWorld protocols [8]. Moreover, OGSA-DQP currently only supports popular relational database products and most BDWorld data resources are not exposed as relational databases. Another option is to layer a virtual OGSA-DAI Grid Data Service over the existing BDWorld database wrappers. This design (Figure 1) preserves the existing BDWorld universal resource invocation mechanism (see 8) and allows the existing database wrappers to be reused with minor modifications. This basic exemplar illustrates how OGSA-DAI could be modified to access web databases but leaving the data integration task to the BDWorld workflow system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accessing Bio-databases with OGSA-DAI - A Performance Analysis

Open Grid Service Architecture Data Access and Integration (OGSA-DAI) is a middleware which aims to provide a unique interface to heterogeneous database management systems and to special type of files like SwissProt files. It could become a vital tool for data integration in life sciences since the data is produced by different sources and residing in different data management systems. With it,...

متن کامل

Design and Implementation of OGSA-DAI-RDF

This paper presents the OGSA-DAI-RDF middleware that extends OGSA-DAI access to RDF database s ystems, e.g., Sesame and Jena. Several OGSA-DAI activities for handling RDF data and ontology are imp lemented. The query language interface is based on SPARQL query language. Introduction The National Institute of Advanced Science and Technology (AIST) of Japan started a 5-year project called AIST-SO...

متن کامل

Grid Enabling Your Data Resources with OGSA-DAI

OGSA-DAI (Open Grid Services Architecture Data Access and Integration) provides an extensible software framework allowing data resources, such as files, relational and XML databases, to be exposed through Web services acting within collaborative Grid environments or, more modestly, in stand-alone mode. OGSA-DAI may be deployed to WSRF-based platforms, such as the Globus Toolkit 4, as well as no...

متن کامل

OGSA-DAI Usage Scenarios and Behaviour: Determining good practice

OGSA-DAI has been developing Grid middleware for over two years now. A high profile project within the Grid community OGSA-DAI is increasingly being used by Grid based projects to provide their Data Access and Integration (DAI) requirements. From a simple set of services relatively sophisticated usage scenarios may be realised. This presentation examines a number of DAI scenarios identified by ...

متن کامل

Experiences with OGSA-DAI: Portlet Access and Benchmark

Portals have proven to be useful client-side applications for providing user-oriented services for accessing the grid. Grids are increasingly being used for collaborative work within the scientific community. The job processing time for high performance computations can be reduced by the usage of computational grids, with its wide availability to resources. Grid users would similarly benefit fr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005